Unify benchmark file download logic#7462
Conversation
Signed-off-by: Robert Kruszewski <github@robertk.io>
Polar Signals Profiling ResultsLatest Run
Previous Runs (1)
Powered by Polar Signals Cloud |
Benchmarks: PolarSignals ProfilingVortex (geomean): 1.006x ➖ datafusion / vortex-file-compressed (1.006x ➖, 0↑ 0↓)
|
File Sizes: PolarSignals ProfilingNo file size changes detected. |
Benchmarks: FineWeb NVMeVerdict: No clear signal (low confidence) datafusion / vortex-file-compressed (0.922x ➖, 3↑ 0↓)
datafusion / vortex-compact (0.934x ➖, 2↑ 0↓)
datafusion / parquet (0.944x ➖, 0↑ 0↓)
duckdb / vortex-file-compressed (1.001x ➖, 0↑ 1↓)
duckdb / vortex-compact (0.938x ➖, 1↑ 0↓)
duckdb / parquet (0.966x ➖, 1↑ 0↓)
Full attributed analysis
|
File Sizes: FineWeb NVMeNo file size changes detected. |
Benchmarks: TPC-H SF=1 on NVMEVerdict: No clear signal (environment too noisy confidence) datafusion / vortex-file-compressed (0.868x ✅, 17↑ 0↓)
datafusion / vortex-compact (0.874x ✅, 20↑ 0↓)
datafusion / parquet (0.868x ✅, 13↑ 0↓)
datafusion / arrow (0.949x ➖, 2↑ 0↓)
duckdb / vortex-file-compressed (0.869x ✅, 20↑ 0↓)
duckdb / vortex-compact (0.923x ➖, 11↑ 0↓)
duckdb / parquet (0.939x ➖, 6↑ 2↓)
duckdb / duckdb (0.990x ➖, 1↑ 0↓)
Full attributed analysis
|
File Sizes: TPC-H SF=1 on NVMENo file size changes detected. |
Benchmarks: TPC-DS SF=1 on NVMEVerdict: No clear signal (low confidence) datafusion / vortex-file-compressed (1.013x ➖, 0↑ 0↓)
datafusion / vortex-compact (1.007x ➖, 0↑ 0↓)
datafusion / parquet (1.013x ➖, 1↑ 1↓)
duckdb / vortex-file-compressed (1.023x ➖, 0↑ 4↓)
duckdb / vortex-compact (1.011x ➖, 2↑ 1↓)
duckdb / parquet (1.007x ➖, 0↑ 2↓)
duckdb / duckdb (1.012x ➖, 1↑ 1↓)
Full attributed analysis
|
File Sizes: TPC-DS SF=1 on NVMENo file size changes detected. |
Benchmarks: FineWeb S3Verdict: No clear signal (low confidence) datafusion / vortex-file-compressed (1.039x ➖, 0↑ 1↓)
datafusion / vortex-compact (0.998x ➖, 0↑ 0↓)
datafusion / parquet (0.952x ➖, 0↑ 0↓)
duckdb / vortex-file-compressed (0.958x ➖, 0↑ 0↓)
duckdb / vortex-compact (0.980x ➖, 0↑ 0↓)
duckdb / parquet (0.970x ➖, 0↑ 0↓)
Full attributed analysis
|
Benchmarks: Random AccessVortex (geomean): 0.864x ✅ unknown / unknown (0.956x ➖, 6↑ 4↓)
|
Benchmarks: TPC-H SF=10 on NVMEVerdict: No clear signal (low confidence) datafusion / vortex-file-compressed (0.989x ➖, 0↑ 0↓)
datafusion / vortex-compact (0.954x ➖, 3↑ 0↓)
datafusion / parquet (1.003x ➖, 0↑ 0↓)
datafusion / arrow (0.991x ➖, 0↑ 0↓)
duckdb / vortex-file-compressed (0.790x ✅, 22↑ 0↓)
duckdb / vortex-compact (0.832x ✅, 22↑ 0↓)
duckdb / parquet (0.879x ✅, 18↑ 0↓)
duckdb / duckdb (0.941x ➖, 8↑ 0↓)
Full attributed analysis
|
File Sizes: TPC-H SF=10 on NVMENo file size changes detected. |
Benchmarks: Statistical and Population GeneticsVerdict: No clear signal (low confidence) duckdb / vortex-file-compressed (0.962x ➖, 1↑ 0↓)
duckdb / vortex-compact (0.976x ➖, 0↑ 0↓)
duckdb / parquet (0.967x ➖, 0↑ 0↓)
Full attributed analysis
|
File Sizes: Statistical and Population GeneticsNo file size changes detected. |
Benchmarks: Clickbench on NVMEVerdict: No clear signal (low confidence) datafusion / vortex-file-compressed (0.891x ✅, 19↑ 2↓)
datafusion / parquet (0.950x ➖, 6↑ 0↓)
duckdb / vortex-file-compressed (0.903x ➖, 19↑ 1↓)
duckdb / parquet (0.922x ➖, 14↑ 0↓)
duckdb / duckdb (0.991x ➖, 4↑ 2↓)
Full attributed analysis
|
File Sizes: Clickbench on NVMEFile Size Changes (1 files changed, -0.0% overall, 0↑ 1↓)
Totals:
|
Benchmarks: CompressionVortex (geomean): 1.005x ➖ unknown / unknown (1.009x ➖, 2↑ 4↓)
|
|
Ideally, we could have a download api in vortex-utils. We also download stuff in vortex-duckdb build.rs etc. Can do in a follow up. As part of that make all downloads atomic etc. |
Benchmarks: TPC-H SF=1 on S3Verdict: No clear signal (environment too noisy confidence) datafusion / vortex-file-compressed (1.205x ➖, 0↑ 6↓)
datafusion / vortex-compact (1.109x ➖, 0↑ 4↓)
datafusion / parquet (1.194x ➖, 0↑ 8↓)
duckdb / vortex-file-compressed (1.124x ➖, 0↑ 0↓)
duckdb / vortex-compact (1.102x ➖, 0↑ 1↓)
duckdb / parquet (1.133x ➖, 0↑ 0↓)
Full attributed analysis
|
Benchmarks: TPC-H SF=10 on S3Verdict: No clear signal (environment too noisy confidence) datafusion / vortex-file-compressed (1.032x ➖, 0↑ 0↓)
datafusion / vortex-compact (1.091x ➖, 0↑ 2↓)
datafusion / parquet (0.980x ➖, 0↑ 0↓)
duckdb / vortex-file-compressed (1.008x ➖, 0↑ 0↓)
duckdb / vortex-compact (1.029x ➖, 0↑ 0↓)
duckdb / parquet (1.108x ➖, 0↑ 2↓)
Full attributed analysis
|
Stop repeating same download logic all over codebase
There's still slight variation of this logic in statpopgen vcf download
Signed-off-by: Robert Kruszewski github@robertk.io